The Role of Frame-Based Representation on the Semantic Web
نویسندگان
چکیده
A new architecture for the World Wide Web is emerging, known as the Semantic Web. In broad terms, it encompasses efforts to populate the Web with content which has formal semantics. This will enable automated agents to reason about Web content, and produce an intelligent response to unforeseen situations. We believe that in order to build the Semantic Web, the sharing of ontological information is required. This allows agents to reach partial shared understanding and thus interoperate. We acknowledge that the World Wide Web Consortium's RDF formalism (and the DARPA Agent Markup Language as its extension) can be seen as adhering to the frame-based representation paradigm, and we will further elaborate on the suitability of this paradigm for building ontologies as well as representing and sharing knowledge on the Web. The paper will discuss required and desirable features of ontological languages, giving examples of the possible usage of frame-based representation and ontologies on the Semantic Web. Introduction: Frame-based Representation Systems The term “Semantic Web” encompasses efforts to build a new world wide web (WWW) architecture that augments content with formal semantics thereby producing content suitable for automated systems to consume, as opposed to content intended for human consumption [Berners-Lee et al 2001]. The Semantic Web will allow us to use more automated functions (reasoning, information and service discovery, autonomous agents, etc.) on the Web, easing the workload of humans. (We probably need not repeat the well-known example here of using a search engine and getting back thousands of "hits" which then need to pruned by hand). The Semantic Web will also pave the way for true "device independence" and customization of information content for individual users, since the information on the Web would now be contained in a "raw form" and any (context-dependent) presentation could be rendered on demand. The success of the deployment of the Semantic Web will largely depend on whether useful ontologies will emerge [McGuinness 2001], allowing shared agreements about vocabularies for knowledge representation [KR]. Considering the Web community at large (as largely disjoint from the KR community), the acceptance for any particular KR technology or paradigm is of critical importance. It is the authors' belief that the simplicity (and ease of understanding) of frame-based representation would make it the paradigm of choice for knowledge representation when building the Semantic Web – given that we are attempting the deployment of “real” KR on a WWW-wide scale. If we offer the Web community a frame-based representation system that allows people to start modeling and (most importantly) sharing their models, some of these models will grow into useful ontologies (partially through a process of "natural selection") and we have taken a step closer to our goal. This observation appears to be shared by other efforts as well – we will provide our perspective on the developments to date and expand on them with our vision. The concept of a frame was proposed in the 1970's [Minsky 1975], and frame systems subsequently gained ground as basic tools for representing knowledge [Fikes & Kehler 1985, Karp 1992, Chaudhri et al 1998]. The fundamental idea of a frame system is rather simple: A frame represents an object or a concept. Attached to the frame is a collection of attributes (slots), potentially having types (or value restrictions) and potentially filled initially with values. When a frame is being used the values of slots can be altered to make the frame correspond to the particular situation at hand. According to an interpretation by Minsky, the slots of a frame might represent questions most likely to arise in a hypothetical situation represented by the frame. Frames are closely related to an earlier structure-based KR technique, called semantic networks [Woods 1975] which, in turn, are based on the idea of human associative memory [Quillian 1967]. Semantic networks may simply be thought of as data structures of nodes – "concepts" – and links – "associations" – between them. If one thinks of frames as concepts, and when other frames are used to fill slots we have an analogous framework. The notion of semantic networks also led to the early work on description logics [DL, Nardi et al forthcoming] as we know them today with the introduction of KL-ONE [Brachman 1977]. This work began with an emphasis on making term definitions in semantic networks more precise. Description logics provide representation and reasoning languages with precise semantics. They also limit language expressiveness so that reasoners can be built that can provide complete (and sound) inference in a tractable manner. Soon after its inception, the notion of a frame system was criticized as not introducing anything new to the field of KR; for example, Pat Hayes has said, "most of 'frames' is just a new syntax for first-order logic" [Hayes 1979]. Although this statement is easy to accept, it doesn't diminish the value of frame systems as easy-to-understand tools for simple KR (starting from what might be called "structural modeling"). An example of a frame-based system that argues both these points is Ontolingua [Farquhar et al. 1997]. It provides a frame-based syntax but then translates all information into KIF, which is just a first order logic encoding of the information [KIF]. Ontolingua has also been coupled with a theorem prover (initially ATP, later JTP) that provides reasoning (however, it can not guarantee tractability). There also exists a connection between frame systems and object-oriented programming (OOP) [Hynynen & Lassila 1989, Lassila 1990], particularly if we think of the "structural modeling" aspect mentioned above. The basic vocabulary is different, but what the terms denote are approximately the same, see table below: OOP Systems Frame Systems Description Logics instance frame, instance, individual instance, individual attribute, instance variable slot role, attribute value filler filler class, type frame, schema class, concept From the adoption viewpoint, it can be observed that many people understand OOP even if they have never heard of frame systems. We can think of frame systems very pragmatically through a "heuristic" interpretation (they are vehicles for storing knowledge and performing inferences), and depart from Minsky's "metaphysical" interpretation. In comparison to OOP systems, frame systems – as indicated above – typically embody some notion of reasoning. Frame system reasoning may sometimes be incomplete (i.e., there is no guarantee that everything that could be deduced from a given set of information may be deduced) and frame systems do not typically make guarantees about the computational tractability of their inference. Description logicbased systems typically provide information (many times proofs) concerning the tractability of their inference and if they do not provide complete inference, they typically provide a detailed discussion of what 1 The term “attribute” has sometimes been used to distinguish single valued roles from multi-valued roles. Attributes in these systems have a maximum cardinality of 1. kind of reasoning can be computed (e.g., [Borgida & Patel-Schneider 1994] and also provide precise semantics (typically denotational semantics) for the meanings of term expressions. RDF and What Is Missing From It The lack of means of sharing information with formal semantics inspired the development of the World Wide Web Consortium’s Resource Description Framework (RDF) metadata standard [Lassila & Swick 1999, Lassila 1998]. Expanding from the traditional notion of document metadata (such as something like library catalog information), RDF is suitable for describing any Web resources, and as such provides interoperability between applications that exchange "semantic" information on the Web. In the RDF model, knowledge is represented as directed labeled graphs (DLGs) where nodes and arcs are named using URIs (Universal Resource Identifiers) [Berners-Lee et al 1998]. Consequently, RDF can describe not just things on the Web (such as pages, parts of pages, or collections of pages) but also things not on the Web as long as they can be named using some URI scheme. Instead of viewing RDF as DLGs (in this respect they resemble semantic networks), one can take a more object-oriented view and think of RDF as a frame-based representation system by viewing the graphs as consisting of object/attribute/value triples (or resource/property/value triples in RDF vocabulary). In comparison, to extend the "translation" from the previous section, at the lowest level, object-oriented programming systems, frame systems, description logic-based systems, and RDF have much in common but again the terms we use are all different: OOP Systems Frame Systems Description Logics RDF instance frame, instance, individual instance, individual resource attribute, instance variable slot role, attribute property value filler filler property value class, type frame, schema class, concept class Descriptions in RDF can span multiple resources: values of properties can be other resources, and it is therefore possible to describe arbitrary relationships between multiple composite resources with structure. Properties themselves are also named by URIs and can be described: what are the permitted values of a particular property, which types of resources can it describe, and what is its relationship to other properties. Meaning in RDF comes from specific terms and concepts being defined and then named by URIs. Because URIs can be made unique, two systems can define some concept (say, "person") and can each use a different URI to name it to avoid "clashes"; on the other hand, two systems agreeing on a common concept will use the same URI and effectively "share" semantics. On top of the basic RDF model is layered an extensible, object-oriented type system (known as RDF Schema) [Brickley & Guha 2000]. The meta-constructs for the type system are terms and concepts named by URIs, so effectively RDF itself is used in representing and defining classes and properties. Class definitions can be derived from multiple superclasses. Property definitions can specify domain and range 2 Note that there also exist procedural languages which have been augmented with some type of reasoning capability (such as production rules). Given that this paper is focused on representation, these procedural languages (e.g., R++ [Litman et al 2001]) are considered out of scope. 3 The term “resource” is used for all nodes of an RDF DLG (including classes). 4 See the Appendix for a simple example of RDF Schema. constraints. One can also think of RDF Schema as a set of ontological modeling primitives on top of RDF. As such, the RDF Schema still needs work. Some of this has recently emerged in the form of the OIL system [Bechhofer et al 2000, Fensel et al 2001, OIL] and work continues within DARPA's DAML program [Hendler & McGuinness 2000]. Additionally, there is a new axiomatic semantics for RDF, RDFS, and the language resulting from the DAML program [Fikes & McGuinness 2001, McGuinness et al 2001].
منابع مشابه
A procedure for Web Service Selection Using WS-Policy Semantic Matching
In general, Policy-based approaches play an important role in the management of web services, for instance, in the choice of semantic web service and quality of services (QoS) in particular. The present research work illustrates a procedure for the web service selection among functionality similar web services based on WS-Policy semantic matching. In this study, the procedure of WS-Policy publi...
متن کاملQuery expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملUse of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملبرچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کاملAn Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)
Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...
متن کامل